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4 <110> APPLICANT: Thibeault, Diane 

5 Lamarre, Daniel 

6 Maurice, Roger 

7 Pilote, Louise 

8 Pause, Armin 





10 


<120> 


TITLE 


OF INVENTION: Purified Active HCV NS2/3 


Protease 






13 


<130> 


FILE REFERENCE: 13/082 
















c— > 


15 


<140> 


CURRENT APPLICATION NUMBER: US/10/017 , 736C 










c — > 


15 


<141> 


CURRENT FILING DATE: 2001- 


■12-14 














15 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


60/256,031 












16 


<151> 


PRIOR 


FILING DATE: 


2000-12-15 






t 










18 


<160> 


NUMBER OF 


SEQ 


ID NOS: 


25 










3 








20 


<170> 


SOFTWARE : 


FastSEQ 


for 


Windows 


Version 4.0 \ 


i 










22 


<210> 


SEQ ID NO: 


1 
























23 


<211> 


LENGTH: 1230 
























24 


<212> 


TYPE: 


DNA 


























25 


<213> 


ORGANISM: 


HCV 
























o n 
Z 1 


<220> 


FEATURE : 


























28 


<221> 


NAME /KEY: 


CDS 
























29 


<222> 


LOCATION: 


(1) • 


. . (1230) 




















31 


<400> 


SEQUENCE : 


1 
























32 


atg 


gac 


egg 


gag 


atg 


get 


gca 


teg 


tgc 


gga 


ggc gcg gtt 


ttc 


ata 


ggt 


48 




33 


Met 


Asp 


Arg 


Glu 


Met 


Ala 


Ala 


Ser 


Cys 


Gly 


Gly Ala Val 


Phe 


He 


Gly 






34 


1 










5 










10 






15 








36 


ctt 


gca 


etc 


ttg 


acc 


ttg 


tea 


cca 


tac 


tat 


aaa gtg etc 


etc 


get 


agg 


96 




37 


Leu 


Ala 


Leu 


Leu 


Thr 


Leu 


Ser 


Pro 


Tyr 


Tyr 


Lys Val Leu 


Leu 


Ala 


Arg 






38 










20 










25 






30 










40 


etc 


ata 


tgg 


tgg 


tta 


cag 


tat 


tta 


ate 


acc 


aga gtc gag 


gcg 


cac 


ttg 


144 




41 


Leu 


He 


Trp 


Trp 


Leu 


Gin 


Tyr 


Leu 


He 


Thr 


Arg Val Glu 


Ala 


His 


Leu 






42 








35 










40 






45 












44 


caa 


gtg 


tgg 


ate 


ccc 


cct 


etc 


aat 


gtt 


egg 


gga ggc cgc 


gat 


gee 


ate 


192 




45 


Gin 


Val 


Trp 


He 


Pro 


Pro 


Leu 


Asn 


Val 


Arg 


Gly Gly Arg 


Asp 


Ala 


He 






46 




50 










55 








60 












48 


ate 


etc 


etc 


acg 


tgc 


gca 


gtc 


cac 


cca 


gag 


eta ate ttt 


gac 


ate 


acc 


240 




49 


He 


Leu 


Leu 


Thr 


Cys 


Ala 


Val 


His 


Pro 


Glu 


Leu lie Phe 


Asp 


He 


Thr 






50 


65 












70 










75 






80 






52 


aaa 


etc 


ctg 


etc 


gee 


ata 


ttc 


ggt 


ccg 


etc 


atg gtg etc 


cag 


gca 


ggc 


288 




53 


Lys 


Leu 


Leu 


Leu 


Ala 


He 


Phe 


Gly 


Pro 


Leu 


Met Val Leu 


Gin 


Ala 


Gly 






54 












85 










90 






95 








56 


ata 


acc 


aaa 


gtg 


ccg 


tac 


ttc 


gtg 


cgt 


gcg 


cag ggg etc 


att 


cgt 


gcg 


336 




57 


He 


Thr 


Lys 


Val 


Pro 


Tyr 


Phe 


Val Arg Ala 


Gin Gly Leu 


He 


Arg 


Ala 






58 










100 










105 






110 










60 


tgt 


atg 


ttg 


gtg 


egg 


aag 


get 


gcg 


ggg 


ggt 


cat tat gtc 


caa 


atg 


gee 


384 
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61 


Cys 


Met 


Leu 


Val 


Arcr 


Lvs 


Ala 


Ala 


Gly 


Gly 


His 


Tvr 


Val 


Gin 


Met 


Ala 




62 






115 










120 










125 










64 


ttc 


at g 


aag 


eta 


get 


CTCCT 
y ^y 


ctg 


aca 


QCTt 
y y u 


acg 


tac 


gtt 


tat 


gac 


cat 


etc 


432 


65 


Phe 


Met 


Lys 


Leu 


Ala 


Ala 


Leu 


Thr 


Glv 


Thr 


Tvr 


Val 


Tvr 


Asp 


His 


Leu 




66 




130 










135 










140 












68 


art 


cca 


1 1 CT 

u uy 


cag 


gat 


T CTCT 
uyy 


gee 


cac 


ctpct 

y *-y 


CTCTP 

yy*- 


eta 


cga 


gac 


ctt 


gca 


ct1~ ct 

y u y 


480 


69 


Thr 


Pro 


Leu 


Gin 


Asp 




Ala 


His 


Ala 


Glv 


Leu 


Arg 


Asp 


Leu 


Ala 


Val 






1 A R 

J. *i .J 




















X -J -J 










1 60 




79 


n r~* n 

gcg 


y La 


gag 






a 


ftp 




y ciu 


3f CT 
CL l. y 


ydg 




aag 


dLL- 


ALL 


a. 




7*3 


A3 ^ 


Val 

V d-L 


m n 

Ul Li 


Pm 


Val 


lie 


Phe 


Ser 


Asp 


Met 


Glu 


Val 


Lys 


He 


He 


Thr 




74 










X \J »J 










170 

X / \J 










175 






7 


tgg 


ggg 


gcg 


gac 


acc 


gcg 


y 




rrrrrr 

ggg 






aft 

a. l. u 


f pa 
U L-Ct 


ggt 




r~* 


S7 


11 






Ala 






Ala 


Ala 




Gly 


Asp 


Tie 

X X c 


He 


Ser 


Gly 


Leu 


Pro 




1 ft 

/ D 








i ft n 

X o u 










1 ft S 

X O -J 










X _7 \J 








ft n 


gt c 


tec 


get 


cga 


agg 


99 a 


agg 


gag 


-3 -J- 


etc 


ctg 


gga 


ccg 


gec 


y o l 


a at 


69 4 


O -L 


V Ct _L 


OC X 


Al ^ 


7\ vrr 
in. J. y 


r-vx y 




Atct 
rix y 


ft! ii 


J — LC 








riu 


Al Pi 


Sep) 






ft 9 
















900 










90^ 










Q A 


4-4-4- 
ttt 


gaa 


ggg 


cag 


ggg 




cga 


etc 


_ 4_ 4- 


gcg 


ccc 


_ 4_ _ 

ate 


acg 


gee 


tac 


tec 


£7 9 


85 


Phe 


Glu 


Gly 


Gin 


Gly 


Trp 


Arg 


Leu 


Leu 


Ala 


Pro 


He 


Thr 


Ala 


Tyr 


Ser 




86 




210 










215 










220 












88 


caa 


cag 


aca 


egg 


ggc 


eta 


ctt 


ggt 


tgc 


ate 


ate 


acc 


age 


etc 


aca 


ggc 


720 


89 


Gin 


Gin 


Thr 


Arg 


Gly 


Leu 


Leu 


Gly 


Cys 


He 


He 


Thr 


Ser 


Leu 


Thr 


Gly 




90 


225 










230 










235 










240 




92 


egg 


gac 


aag 


aac 


cag 


gtc 


gag 


ggg 


gag 


gtt 


caa 


gtg 


gtc 


tec 


acc 


get 


768 


93 


Arg 


Asp 


Lys 


Asn 


Gin 


Val 


Glu 


Gly 


Glu 


Val 


Gin 


Val 


Val 


Ser 


Thr 


Ala 




94 










245 










250 










255 






96 


aca 


caa 


tct 


ttc 


ctg 


gcg 


acc 


tgc 


gtc 


aac 


ggc 


gtg 


tgt 


tgg 


act 


gtc 


816 


97 


Thr 


Gin 


Ser 


Phe 


Leu 


Ala 


Thr 


Cys 


Val 


Asn 


Gly 


Val 


Cys 


Trp 


Thr 


Val 




98 








260 










265 










270 









100 


ttc 


cat 


ggc 


gee 


ggc 


tea 


aag 


acc 


ttg 


gee 


ggc 


ccc 


aaa 


ggc 


cca 


ate 


864 


101 


Phe 


His 


Gly 


Ala 


Gly 


Ser 


Lys 


Thr 


Leu 


Ala 


Gly 


Pro 


Lys 


Gly 


Pro 


He 




102 






275 










280 










285 










104 


acc 


cag 


atg 


tac 


act 


aat 


gtg 


gac 


cag 


gac 


etc 


gtc 


ggc 


tgg 


cag 


gcg 


912 


105 


Thr 


Gin 


Met 


Tyr 


Thr 


Asn 


Val 


Asp 


Gin 


Asp 


Leu 


Val 


Gly 


Trp 


Gin 


Ala 




106 




290 










295 










300 












108 


ccc 


cct 


ggg 


gcg 


cgc 


tec 


atg 


aca 


cca 


tgc 


acc 


tgc 


ggc 


age 


teg 


gac 


960 


109 


Pro 


Pro 


Gly 


Ala 


Arg 


Ser 


Met 


Thr 


Pro 


Cys 


Thr 


Cys 


Gly 


Ser 


Ser 


Asp 




110 


305 










310 










315 










320 




113 


etc 


tat 


ttg 


gtc 


acg 


aga 


cat 


gee 


gac 


gtc 


att 


ccg 


gtg 


cgc 


egg 


egg 


1008 


114 


Leu 


Tyr 


Leu 


Val 


Thr 


Arg 


His 


Ala 


Asp 


Val 


He 


Pro 


Val 


Arg 


Arg 


Arg 




115 










325 










330 










335 






117 


ggc 


gac 


agt 


agg 


ggg 


age 


ctg 


etc 


tec 


ccc 


agg 


cct 


gtc 


tec 


tac 


ttg 


1056 


118 


Gly 


Asp 


Ser 


Arg 


Gly 


Ser 


Leu 


Leu 


Ser 


Pro 


Arg 


Pro 


Val 


Ser 


Tyr 


Leu 




119 








340 










345 










350 








121 


aag 


ggc 


tct 


teg 


ggt 


ggc 


cca 


ctg 


etc 


tgc 


cct 


teg 


ggg 


cac 


get 


gtg 


1104 


122 


Lys 


Gly 


Ser 


Ser 


Gly 


Gly 


Pro 


Leu 


Leu 


Cys 


Pro 


Ser 


Gly 


His 


Ala 


Val 




123 






355 










360 










365 










125 


ggc 


ate 


ttc 


egg 


get 


get 


gtg 


tgc 


acc 


egg 


ggg 


gtt 


gca 


aaa 


gcg 


gtg 


1152 


126 


Gly 


He 


Phe 


Arg 


Ala 


Ala 


Val 


Cys 


Thr 


Arg 


Gly 


Val 


Ala 


Lys 


Ala 


Val 
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127 



370 



375 



380 



129 gac ttc ata cct gtt gag tct atg gaa act acc atg egg act agt age 1200 

130 Asp Phe He Pro Val Glu Ser Met Glu Thr Thr Met Arg Thr Ser Ser 

131 385 390 395 400 

133 get tgg cgt cac ccg cag ttc ggt ggt taa 1230 

134 Ala Trp Arg His Pro Gin Phe Gly Gly * 



1 *3 O 

loo 


<210> SEQ ID NO 


: 2 
























ioy 


<211> LENGTH: 409 
























140 


<212> TYPE: 


PRT 


























T A "\ 
141 


<213> ORGANISM: 


























143 


<4 00> SEQUENCE: 


Z 
























T A A 

14 4 


Met 


Asp 


Arg 


Glu 


Met 


TV *| -, 

Ala 


Ala 


Ser 


Cys 


biy 


Gly 


Ala 


val 


Pne 


lie 


pi 
Gly 


145 










c 
O 










10 










15 




147 


Leu 


Ala 


Leu 


Leu 


T Vi v 

i nr 


Leu 


Ser 


Pro 


Tyr 


Tyr 


Lys 


Val 


Leu 


Leu 


Ala 


Arg 


148 








Zu 










25 










30 






loO 


Leu 


lie 


Trp 


Trp 


Leu 


bin 


Tyr 


Leu 


lie 


inr 


Arg 


val 


GlU 


Ala 


HIS 


Leu 


151 






JJ 










4 0 










45 








lOO 


Gin 


Val 


Trp 


lie 


Pro 


Pro 


Leu 


Asn 


Val 


Arg 


Gly 


Gly 


Arg 


Asp 


Ala 


lie 


154 




oO 










55 










60 










loo 


lie 


Leu 


Leu 


Thr 


Cys 


Ala 


Val 


His 


Pro 


GlU 


Leu 


He 


Pne 


Asp 


lie 


Thr 


157 


oo 










70 










75 










80 


ioy 


Lys 


Leu 


T ***** 

Leu 


Leu 


7\ -| _ 

Ala 


lie 


Pne 


Gly 


Pro 


Leu 


Met 


Val 


Leu 


Gin 


Ala 


Gly 


160 










OO 










90 










90 




162 


lie 


inr 


Lys 


val 


Pro 


Tyr 


Pne 


Val 


Arg 


Ala 


Gin 


Gly 


Leu 


He 


Arg 


Ala 


loo 








j. \j \j 










lOo 










Tin 

110 






loo 


Cys 


Met 


Leu 


Val 


Arg 


Lys 


Ala 


Ala 


Gly 


Gly 


His 


Tyr 


Val 


Gin 


Met 


Ala 


loo 






115 










ion 
1Z U 










1 OC 

1ZO 








1 DO 


Phe 


Met 


Lys 


Leu 


Ala 


7\ 1 -a 
M.A. 3. 


Leu 


l nr 


oiy 


i nr 


Tyr 


Val 


lyr 


Asp 


nis 


Leu 


n c o 

i t>y 




130 










loo 










14 U 










171 


Thr 


Pro 


Leu 


Gin 


Asp 


Trn 
1 ip 


AT a 


Hi ^ 






Leu 


Arg 






Al ^ 


Val 


172 


145 










150 










155 










160 


174 


Ala 


Val 


Glu 


Pro 


Val 


He 


Phe 


Ser 


Asp 


Met 


Glu 


Val 


Lys 


He 


He 


Thr 


175 










165 










170 










175 




177 


Trp 


Gly 


Ala 


Asp 


Thr 


Ala 


Ala 


Cys 


Gly 


Asp 


He 


He 


Ser 


Gly 


Leu 


Pro 


178 








180 










185 










190 






180 


Val 


Ser 


Ala 


Arg 


Arg 


Gly 


Arg 


Glu 


He 


Leu 


Leu 


Gly 


Pro 


Ala 


Asp 


Asn 


181 






195 










200 










205 








183 


Phe 


Glu 


Gly 


Gin 


Gly 


Trp 


Arg 


Leu 


Leu 


Ala 


Pro 


He 


Thr 


Ala 


Tyr 


Ser 


184 




210 










215 










220 










186 


Gin 


Gin 


Thr 


Arg 


Gly 


Leu 


Leu 


Gly 


Cys 


He 


lie 


Thr 


Ser 


Leu 


Thr 


Gly 


187 


225 










230 










235 










240 


189 


Arg 


Asp 


Lys 


Asn 


Gin 


Val 


Glu 


Gly 


Glu 


Val 


Gin 


Val 


Val 


Ser 


Thr 


Ala 


190 










245 










250 










255 




192 


Thr 


Gin 


Ser 


Phe 


Leu 


Ala 


Thr 


Cys 


Val 


Asn 


Gly 


Val 


Cys 


Trp 


Thr 


Val 


193 








260 










265 










270 






195 


Phe 


His 


Gly 


Ala 


Gly 


Ser 


Lys 


Thr 


Leu 


Ala 


Gly 


Pro 


Lys 


Gly 


Pro 


He 


196 






275 










280 










285 








198 


Thr 


Gin 


Met 


Tyr 


Thr 


Asn 


Val 


Asp 


Gin 


Asp 


Leu 


Val 


Gly 


Trp 


Gin 


Ala 



135 



405 
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199 




290 










295 










300 












201 


Pro 


Pro 


Glv 


Ala 


Arg 


Ser 


Met 


Thr 


Pro 


Cys 


Thr 


Cys 


Glv 


Ser 


Ser 


Asp 




202 


305 










310 










315 










320 




204 


Leu 




Leu 


Val 


Thr 


Arg 


His 


Ala 


Asp 


Val 


lie 


Pro 


Val 


Arg 


Arg 


Arg 




205 










325 










330 










335 






207 


Gly 


Asp 


Ser 


Arg 


Gly 


Ser 


Leu 


Leu 


Ser 


Pro 


Arg 


Pro 


Val 


Ser 


Tvr 


Leu 




208 








340 










345 










350 








210 


Lys 


Gly 


Ser 


Ser 


Gly 


Gly 


Pro 


Leu 


Leu 


Cys 


Pro 


Ser 


Glv 


His 


Ala 


Val 




211 






355 










360 










365 










213 


Gly 


He 


Phe 


Arg 


Ala 


Ala 


Val 


Cys 


Thr 


Arg 


Gly 


Val 


Ala 


Lys 


Ala 


Val 




914 




370 










375 










380 












216 


Asp 


Phe 


He 


Pro 


Val 


Glu 


Ser 


Met 


Glu 


Thr 


Thr 


Met 


Arg 


Thr 


Ser 


Ser 




217 


385 










390 










395 










400 




91 Q 

Z. -L Z> 


Ala 


Trp 


Arg 


His 


Pro 


Gin 


Phe 




ri~\ xr 

vj_l y 


















990 

Z. Z. U 










405 


























223 


<210> SEQ ID NO: 


: 3 


























294 

Z. Z. 1 


<211> LENGTH: 1011 


























99^ 

Z, Z. J 


<212> TYPE: 


DNA 




























226 


<213> ORGANISM: 


V 


























99R 

Z. Z. O 


<220> FEATURE: 






























<221> NAME/KEY: 


CDS 


























230 


<222> LOCATION: 


(1) ■ 


. . . (1005) 






















9^9 
z. o z. 


<400> SEQUENCE: 


3 


























9 

z j j 


atg 


aaa 


aag 


aaa 


aag 


etc 


gag 


ex. i_ 




rat 




cat 


cac 


act 


agt 


gca 


48 


234 


Met 


Lys 


Lys 


Lys 


Lys 


Leu 


Glu 


His 


His 


His 


His 


His 


His 


Thr 


Ser 


Ala 




Z. J *J 


1 








5 










10 










15 






9^7 

Z. J / 


ggc 


ata 


acc 


aaa 


gtg 


ccg 


tac 










c n 


arret 

yyy 








96 


238 


Gly 


He 


Thr 


Lys 


Val 


Pro 


Tyr 


Phe 


Val 


Arg 


Ala 


Gin 


Glv 


Leu 


He 


Arg 




9 

z. o _7 








20 










9 S 

Z, — ' 










30 








941 

Z. *i -L 


gcg 


tgt 


atg 


ttg 


gtg 


egg 


aag 


y ^ u 


y u y 


yyy 


y y L 


cat 


tat 


gtc 


caa 


atg 


144 


242 


Ala 


Cys 


Met 


Leu 


Val 


Arg 


Lys 


Ala 


Ala 


Glv 


Glv 


His 


Tvr 


Val 


Gin 


Met 




24 ^ 

<£. *± _) 






35 










40 










45 










94 S 

Z. 1 ^> 


gcc 


ttc 


atg 


aag 


eta 


get 


gcg 


ct g 


aca 


yy u 


acg 


tac 


att 


tat 


gac 


cat 


192 


246 


Ala 


Phe 


Met 


Lys 


Leu 


Ala 


Ala 


Leu 


Thr 


Glv 


Thr 


Tvr 


Val 


Tvr 


Asp 


His 




247 




50 










55 










60 












24 9 


etc 


act 


cca 


ttg 


cag 


gat 


tgg 


gcc 


cac 


y^-y 


one 


eta 


cga 


gac 


ctt 


gca 


240 


250 


Leu 


Thr 


Pro 


Leu 


Gin 


Asp 


Trp 


Ala 


His 


Ala 


Glv 


Leu 


Ara 


Asp 


Leu 


Ala 




251 


65 










70 










75 










80 




253 


gtg 


gcg 


gta 


gag 


ccc 


gtc 


ate 


ttc 


tct 


gac 


atg 


gag 


gtc 


aag 


ate 


ate 


288 


254 


Val 


Ala 


Val 


Glu 


Pro 


Val 


He 


Phe 


Ser 


Asp 


Met 


Glu 


Val 


Lys 


He 


He 




255 










85 










90 










95 






257 


acc 


tgg 


ggg 


gcg 


gac 


acc 


gcg 


gca 


tgc 


ggg 


gac 


ate 


att 


tea 


ggt 


ctg 


336 


258 


Thr 


Trp 


Gly 


Ala 


Asp 


Thr 


Ala 


Ala 


Cys 


Gly 


Asp 


He 


He 


Ser 


Gly 


Leu 




259 








100 










105 










110 








261 


ccc 


gtc 


tec 


get 


cga 


agg 


gga 


agg 


gag 


ata 


etc 


ctg 


gga 


ccg 


gcc 


gat 


384 


262 


Pro 


Val 


Ser 


Ala 


Arg 


Arg 


Gly 


Arg 


Glu 


He 


Leu 


Leu 


Gly 


Pro 


Ala 


Asp 




263 






115 










120 










125 










265 


aat 


ttt 


gaa 


ggg 


cag 


ggg 


tgg 


cga 


etc 


ctt 


gcg 


ccc 


ate 


acg 


gcc 


tac 


432 


266 


Asn 


Phe 


Glu 


Gly 


Gin 


Gly 


Trp 


Arg 


Leu 


Leu 


Ala 


Pro 


lie 


Thr 


Ala 


Tyr 
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267 




130 










135 






140 










269 


tec 


caa 


caa 


aca 


caa 


ggc 


eta 


ctt 


ggt tgc ate 


ate 


acc 


aqc 


etc aca 


480 


270 


Ser 


Gin 


Gin 


Thr 


Arg 


Gly Leu 


Leu Gly Cys He 


He 


Thr 


Ser 


Leu Thr 




271 


145 










150 






155 








160 




273 


aac 
y y ^ 


caa 


gac 


aag 


aac 


cag 


gtc 


gag 


ggg gag gtt 


caa 


gtg 


gtc 


tec acc 


528 


274 


Gly 


Arg 


Asp 


Lvs 


Asn 


Gin Val Glu Gly Glu Val 


Gin 


Val 


Val 


Ser Thr 




275 










165 








170 








175 




277 


get 


aca 


caa 


tct 


ttc 


ctg 


gcg 


acc 


tgc gtc aac 


ggc 


gtg 


tgt 


tgg act 


576 


278 


Ala 


Thr 


Gin 


Ser 


Phe 


Leu 


Ala 


Thr 


Cys Val Asn Gly Val 


Cys 


Trp Thr 




279 








180 










185 






190 






281 


gt c 


ttc 


cat 


aac 
yy *- 


gee 


ggc 


tea 


aag 


acc ttg gee 


ggc 


ccc 


aaa 


ggc cca 


624 


282 


Val 


Phe 


His 


Glv 


Ala 


Gly Ser Lys 


Thr Leu Ala 


Gly 


Pro 


Lys 


Gly Pro 




283 






195 










200 






205 








285 


ate 


acc 


cag 


a t g 


t ac 


act 


aat 


gtg 


gac cag gac 


etc 


gtc 


ggc 


tgg cag 


672 


286 


lie 


Thr 


Gin 


Met 


Tvr 


Thr Asn Val Asp Gin Asp 


Leu 


Val 


Gly Trp Gin 




287 




210 










91 R 






220 










289 


ncn 

y *~y 


ccc 


cct 


aan 

yy y 


ncn 

y *-y 


oy o 


LOO 


-a 4- <T 
dLtj 


aca cca tgc 


acc 


tgc 


ggc 


age teg 


720 


290 


Ala 


Pro 


Pro 


Glv 


Ala 


Arg 


oer 




Thr Pro Cys 


Thr 


Cys 


Gly Ser Ser 




291 


225 










9^0 






235 








240 




293 


gac 


ct c 


tat 




gt c 


acg 


aga 


cat 


gee gac gtc 


att 


ccg 


gtg 


cgc egg 


768 


294 


Asp 


Leu 


Tvr 


Leu 


Val 


X I1X 




His 


Ala Asp Val 


He 


Pro Val Arg Arg 














245 








250 








255 




297 


enn 


nnc 

yy 


gac 


agt 


ri an 

a yy 




age 


ctg 


etc tec ccc 


agg 


cct 


gtc 


tec tac 


816 


298 


Arg 


Glv 


Asp 


Ser 


Arg 


Gly 


Ser 


Leu 


Leu Ser Pro 


Arg 


Pro 


Val 


Ser Tyr 




299 








260 










265 






270 






Jul 


tug 




yy*- 


tct 




ggt 


ggc 


cca 


ctg etc tgc 


cct 


teg 


ggg 


cac get 


864 


302 


Leu 


Lys 


Glv 


Ser 


Ser 


Gly 


Gly 


Pro 


Leu Leu Cys 


Pro 


Ser 


Gly His Ala 




303 






275 










280 






285 








30S 

JU J 


gtg 


ggc 


ate 


ttc 


egg 


get 


get 


gtg 


tgc acc egg 


ggg 


gtt 


gca 


aaa gcg 


912 


306 


Val 


Gly 


He 


Phe 


Arg 


Ala 


Ala 


Val 


Cys Thr Arg Gly Val Ala 


Lys Ala 




307 




290 










295 






300 










309 


gtg 


gac 


ttc 


ata 


cct 


gtt 


gag 


tct 


atg gaa act 


acc 


atg 


egg 


act agt 


960 


310 


Val 


Asp 


Phe 


He 


Pro 


Val 


Glu 


Ser 


Met Glu Thr 


Thr 


Met 


Arg 


Thr Ser 




311 


305 










310 






315 








320 




313 


age 


get 


tgg 


cgt 


cac 


ccg 


cag 


ttc 


ggt ggt aaa 


aag 


aaa 


aag 


taa 


1005 


314 


Ser 


Ala 


Trp 


Arg 


His 


Pro 


Gin 


Phe 


Gly Gly Lys 


Lys 


Lys 


Lys 


* 




315 










325 








330 












317 


ggatcc 
























1011 


320 


<210> SEQ ID NO: 


: 4 




















321 


<211> LENGTH: 334 




















322 


<212> TYPE: 


PRT 






















323 


<213> ORGANISM: 


HCV 




















325 


<400> SEQUENCE: 


4 




















326 


Met 


Lys 


Lys 


Lys 


Lys 


Leu 


Glu 


His 


His His His 


His 


His 


Thr 


Ser Ala 




327 


1 








5 








10 








15 




329 


Gly 


He 


Thr 


Lys 


Val 


Pro 


Tyr 


Phe Val Arg Ala Gin Gly Leu 


He Arg 




330 








20 










25 






30 






332 


Ala 


Cys 


Met 


Leu 


Val 


Arg 


Lys 


Ala 


Ala Gly Gly 


His 


Tyr 


Val 


Gin Met 




333 






35 










40 






45 
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RAW SEQUENCE LISTING ERROR SUMMARY 

PATENT APPLICATION: US/10/017 , 736C 



DATE: 02/12/2004 
TIME: 17:18:51 



Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\02122004\J017736C.raw 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the 
<220> 

to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:19; Xaa Pos. 6 
Seq#:20; Xaa Pos. 6 



file://C:\CRF4\OUTHOLD\VsrJ017736C.htm 



2/12/04 



Page 7 of 8 



VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/017 , 736C 



DATE: 02/12/2004 
TIME: 17:18:51 



Input Set : A:\PTO.AMC.txt 

Output Set: N:\CRF4\02122004\J017736C.raw 



L:15 M:270 C: Current Application Number differs, Replaced Current Application No 
L:15 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:1077 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:19 
L:1080 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:19 
L:1084 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:19 
L:1085 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:19 after pos.:0 
L:1103 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:20 
L:1104 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:20 after pos . : 0 



e 
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4 <110> APPLICANT: Thibeault, Diane 

5 Lamarre, Daniel 

6 Maurice, Roger 

7 Pilote, Louise 

8 Pause, Armin 

10 <120> TITLE OF INVENTION: Purified Active HCV NS2/3 Protease 

13 <130> FILE REFERENCE: 13/082 
C — > 15 <140> CURRENT APPLICATION NUMBER: US/10/017 , 736C 
C — > 15 <141> CURRENT FILING DATE: 2001-12-14 

15 <150> PRIOR APPLICATION NUMBER: 60/256,031 

16 <151> PRIOR FILING DATE: 2000-12-15 
18 <160> NUMBER OF SEQ ID NOS : 25 

20 <170> SOFTWARE: FastSEQ for Windows Version 4.0 



ERRORED 


SEQUENCES 




















22 


<210> SEQ ID NO: 1 














^P/J/ 


23 


<211> LENGTH: 1230 


















24 


<212> TYPE: DNA 


















25 


<213> ORGANISM: 


HCV 


















27 


<220> FEATURE: 




















28 


<221> NAME/KEY: 


CDS 
















29 


<222> LOCATION: 


(1) 


. . . (1230 














31 


<400> SEQUENCE: 


1 


















32 


atg gac egg gag 


atg 


get 


gca 


teg 


tgc 


gga ggc gcg 


gtt ttc ata 


ggt 


48 


33 


Met Asp Arg Glu 


Met 


Ala 


Ala 


Ser 


Cys 


Gly Gly Ala 


Val Phe He 


Gly 




' 34 


1 


5 










10 


15 






36 


ctt gca etc ttg 


ace 


ttg 


tea 


cca 


tac 


tat aaa gtg 


etc etc get 


agg 


96 


37 


Leu Ala Leu Leu 


Thr 


Leu 


Ser 


Pro 


Tyr 


Tyr Lys Val 


Leu Leu Ala 


Arg 




38 


20 










25 




30 






40 


etc ata tgg tgg 


tta 


cag 


tat 


tta 


ate 


acc aga gtc 


gag gcg cac 


ttg 


144 


41 


Leu He Trp Trp 


Leu 


Gin 


Tyr 


Leu 


He 


Thr Arg Val 


Glu Ala His 


Leu 




42 


35 








40 






45 






44 


caa gtg tgg ate 


ccc 


cct 


etc 


aat 


gtt 


egg gga ggc 


cgc gat gee 


ate 


192 


45 


Gin Val Trp He 


Pro 


Pro 


Leu 


Asn 


Val Arg Gly Gly 


Arg Asp Ala 


He 




46 


50 






55 






60 








48 


ate etc etc acg 


tgc 


gca 


gtc 


cac 


cca 


gag eta ate 


ttt gac ate 


acc 


240 


49 


He Leu Leu Thr 


Cys 


Ala 


Val 


His 


Pro 


Glu Leu He 


Phe Asp He 


Thr 




50 


65 




70 








75 




80 




52 


aaa etc ctg etc 


gee 


ata 


ttc 


ggt 


ccg 


etc atg gtg 


etc cag gca 


ggc 


288 


53 


Lys Leu Leu Leu 


Ala 


He 


Phe 


Gly 


Pro* 


Leu Met Val 


Leu Gin Ala 


Gly 




54 




85 










90 


95 
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56 


ata 


acc 


aaa 


gtg 


ccg 


tac 


ttc 


gtg 


cgt 


gcg 


cag 


ggg 


etc 


att 


cgt 


gcg 


336 


0 / 


Ti- 
ne 


inr 


Lys 


vai 


Pro 


Tyr 


Phe 


T7_ T 

val 


Arg 


Ala 


bin 


Gly 


Leu 


lie 


Arg 


Ala 




58 








100 










105 










110 








60 


tgt 


atg 


ttg 


gtg 


egg 


aag 


get 


gcg 


ggg 


ggt 


cat 


tat 


gtc 


caa 


atg 


gee 


384 


61 


Cys 


Met 


Leu 


T 7 _ 1 

val 


Arg 


Lys Ala 


Ala 


Gly 


Gly 


His 


Tyr 


val 


Gin 


Met 


Ala 




62 






115 










120 










125 










64 


ttc 


atg 


aag 


eta 


get 


gcg 


ctg 


aca 


ggt 


acg 


tac 


gtt 


tat 


gac 


cat 


etc 


432 


65 


Phe 


Met 


Lys 


Leu 


Ala 


Ala 


Leu 


Thr 


Gly 


Thr 


Tyr 


Val 


Tyr 


Asp 


His 


Leu 




66 




130 










135 










140 












68 


act 


cca 


ttg 


cag 


gat 


tgg 


gee 


cac 


gcg 


ggc 


eta 


cga 


gac 


ctt 


gca 


gtg 


480 


69 


Thr 


Pro 


Leu 


Gin 


Asp 


Trp Ala 


His 


Ala 


Gly 


Leu 


Arg 


Asp 


Leu 


Ala 


Val 




70 


145 










150 










155 










160 




72 


gcg 


gta 


gag 


ccc 


gtc 


ate 


ttc 


tct 


gac 


atg 


gag 


gtc 


aag 


ate 


ate 


acc 


528 


73 


Ala 


Val 


Glu 


Pro 


Val 


He 


Phe 


Ser Asp 


Met 


Glu 


Val 


Lys 


He 


lie 


Thr 




74 










165 










170 










175 






76 


tgg 


ggg 


gcg 


gac 


acc 


gcg 


gca 


tgc 


ggg 


gac 


ate 


att 


tea 


ggt 


ctg 


ccc 


576 


77 


Trp Gly Ala 


Asp 


Thr 


Ala 


Ala 


Cys 


Gly 


Asp 


He 


lie 


Ser 


Gly 


Leu 


Pro 




78 








180 










185 










190 








80 


gtc 


tec 


get 


cga 


agg 


gga 


agg 


gag 


ata 


etc 


ctg 


gga 


ccg 


gee 


gat 


aat 


624 


81 


Val 


Ser 


Ala 


Arg 


Arg 


Gly Arg 


Glu 


He 


Leu 


Leu 


Gly 


Pro 


Ala 


Asp 


Asn 




82 






195 










200 










205 










84 


ttt 


gaa 


ggg 


cag 


ggg 


tgg 


cga 


etc 


ctt 


gcg 


ccc 


ate 


acg 


gee 


tac 


tec 


672 


85 


Phe 


Glu 


Gly 


Gin 


Gly 


Trp Arg 


Leu 


Leu 


Ala 


Pro 


He 


Thr 


Ala 


Tyr 


Ser 




86 




210 










215 










220 












88 


caa 


cag 


aca 


egg 


ggc 


eta 


ctt 


ggt 


tgc 


ate 


ate 


acc 


age 


etc 


aca 


ggc 


720 


89 


Gin 


Gin 


Thr 


Arg 


Gly 


Leu 


Leu 


Gly Cys 


He 


He 


Thr 


Ser 


Leu 


Thr 


Gly 




90 


225 










230 










235 










240 




92 


egg 


gac 


aag 


aac 


cag 


gtc 


gag 


ggg 


gag 


gtt 


caa 


gtg 


gtc 


tec 


acc 


get 


768 


93 


Arg Asp 


Lys 


Asn 


Gin 


Val 


Glu 


Gly Glu 


Val 


Gin 


Val 


Val 


Ser 


Thr 


Ala 




94 










245 










250 










255 






96 


aca 


caa 


tct 


ttc 


ctg 


gcg 


acc 


tgc 


gtc 


aac 


ggc 


gtg 


tgt 


tgg 


act 


gtc 


816 


97 


Thr 


Gin 


Ser 


Phe 


Leu 


Ala 


Thr 


Cys 


Val 


Asn 


Gly 


Val 


Cys 


Trp 


Thr 


Val 




98 








260 










265 










270 









100 


ttc 


cat 


ggc 


gee 


ggc 


tea 


101 


Phe 


His 


Gly 


Ala 


Gly 


Ser 


102 






275 








104 


acc 


cag 


atg 


tac 


act 


aat 


105 


Thr 


Gin 


Met 


Tyr 


Thr 


Asn 


106 




290 










108 


ccc 


cct 


ggg 


gcg 


cgc 


tec 


109 


Pro 


Pro 


Gly 


Ala 


Arg 


Ser 


110 


305 










310 


113 


etc 


tat 


ttg 


gtc 


acg 


aga 


114 


Leu 


Tyr 


Leu 


Val 


Thr 


Arg 


115 










325 





aag acc ttg gee ggc ccc aaa ggc cca ate 864 
Lys Thr Leu Ala Gly Pro Lys Gly Pro He 

280 285 
gtg gac cag gac etc gtc ggc tgg cag gcg 912 
Val Asp Gin Asp Leu Val Gly Trp Gin Ala 
295 300 

atg aca cca tgc acc tgc ggc age teg gac 960 
Met Thr Pro Cys Thr Cys Gly Ser Ser Asp 
315 320 
cat gee gac gtc att ccg gtg cgc egg egg 1008 



Val He Pro Val Arg Arg Arg * . J i _ I 
330 335 ^^/^ W f6fU** 



W— > 117 / 
ggc gac agt agg ggg age ctg etc tec ccc agg cct gtc tec tac ttg 1056 Gly Asp Ser Arg Gly Ser 
W-- > HiB 340 345 350 

E — > 120 aag ggc tct teg ggt ggc cca ctg etc tgc cct teg ggg cac get gtg 1104 

121 Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ser Gly His Ala Val 
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W— > 122 355 360 365 

E — > 124 ggc ate ttc egg get get gtg tgc ace egg ggg gtt gea aaa gcg gtg 1152 

125 Gly He Phe Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala Val 
W— > 126 370 375 380 

E — > 128 gac ttc ata cct gtt gag tct atg gaa act ace atg egg act agt age 1200 

129 Asp Phe He Pro Val Glu Ser Met Glu Thr Thr Met Arg Thr Ser Ser 
W— > 130 385 390 395 400 

E — > 132 get tgg cgt cac ccg cag ttc ggt ggt taa 1230 

133 Ala Trp Arg His Pro Gin Phe Gly Gly * 
E— > 134 405 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: .02/12/2004 

PATENT APPLICATION: US/10/017 , 736C , TIME: 11:19:38 

Input Set : A:\13_082 Substitute Sequence listing US.txt 
' i Output Set: N:\CRF4\02122004\J017736C.raw 

Invalid Line Length: 

The rules require that a line not exceed 72 characters in length. This includes spaces. 

Seq#:l; Line(s) 117 
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VERIFICATION SUMMARY DATE: 02/12/2004 

t PATENT APPLICATION: US/10/017 , 736C y TIME: 11:19:38 

Input Set : A:\13_082 Substitute Sequence listing US.txt 
j Output Set: N:\CRF4\02122004\J017736C.raw 

L:15 M:270 C: Current Application Number differs, Replaced Current Application No 

L:15 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

L:117 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS:17 

L:118 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 

L:120 M:254 E: No. of Bases conflict, LENGTH : Input : 1104 Counted: 1056 SEQ:1 

L:122 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 

M:254 Repeated in SeqNo=l 

L:126 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 

L:130 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 

L:134 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 

L:134 M:252 E: No. of Seq. differs, <211> LENGTH: Input : 1230 Found:1182 SEQ:1 

L:1076 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:19 

L:1079 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:19 

L:1083 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:19 

L:1084 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:19 after pos.:0 

L:1102 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:20 

L-.1103 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:20 after pos . : 0 



file://C:\CRF4\Outhold\VsrJ01 7736C.htm 



